Blocking Black Area Method for Speech Segmentation
نویسندگان
چکیده
Speech segmentation is an important sub problem of automatic speech recognition. This research is concerned with the development of a continuous speech segmentation system using Bangla Language. This paper presents a dynamic thresholding algorithm to segment the continuous Bngla speech sentences into words/sub-words. The research uses Otsu’s method for dynamic thresholding and introduces a new approach, named blocking black area method to identify the voiced regions of the continuous speech in speech segmentation. The developed system has been justified with continuously spoken several Bangla sentences. To test the performance of the system, 100 Bangla sentences have been recorded from 5 (five) male speakers of different ages and 656 words have been presented in the 100 Bangla sentences. So, the speech database contains 500 Bangla sentences with 3280 words. All the algorithms and methods used in this research are implemented in MATLAB and the proposed system has been achieved the average segmentation accuracy of 90.58%. Keywords—Blocking Black Area; Boundary Detection; Dynamic Thresholding; Otsu’s Algorithm; Speech Segmentation
منابع مشابه
Training Set of Data Bin for Small Black Pixels Neighborhood Recognition of Each Boundary
We first describe how to “fuzzify” the estimated binary columns to create a [0,1]-valued column. Werefer to this [0,1] -valued column as the soft segmentation column of the noisy spectrogram column.Similarly to the collection of soft segmentation columns as the soft segmentation image, or simply asthe soft segmentation. The band-dependent posterior probability that the hard segmentation columnv...
متن کاملPhonetic segmentation using multiple speech features
In this paper we propose a method for improving the performance of the segmentation of speech waveforms to phonetic segments. The proposed method is based on the well known Viterbi timealignment algorithm and utilizes the phonetic boundary predictions from multiple speech parameterization techniques. Specifically, we utilize the best, with respect to boundary type, phone transition position pre...
متن کاملQuantification of area percentage of immunohistochemical staining by true color image analysis with application of fixed thresholds.
Most image analysis systems (IAS) use black-and-white cameras. However, true color IASs are considered to be useful for quantification of immunohistologically stained structures. Using a true color IAS, we evaluated two methods of segmentation for quantification of area percentage of staining: one using fixed, preset thresholds and one using thresholds interactively set per image. Furthermore, ...
متن کاملWord segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملEvolutionary Algorithm for Speech Segmentation
Speech segmentation is one of the problems in speech processing area. The main techniques that attempt to solve it are manual segmentation and hidden Markov models alignment. In this work a new technique based on an evolutionary algorithm that permits to segment the speech without previous training process is presented.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015